Overview

Dataset Statistics

Number of Variables 20
Number of Rows 10127
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 4.9 MB
Average Row Size in Memory 511.2 B
Variable Types
  • Categorical: 10
  • Numerical: 10

Dataset Insights

Months_on_book is skewed Skewed
Credit_Limit is skewed Skewed
Total_Revolving_Bal is skewed Skewed
Avg_Open_To_Buy is skewed Skewed
Total_Amt_Chng_Q4_Q1 is skewed Skewed
Total_Ct_Chng_Q4_Q1 is skewed Skewed
Avg_Utilization_Ratio is skewed Skewed
Attrition_Flag has constant length 17 Constant Length
Gender has constant length 1 Constant Length
Dependent_count has constant length 1 Constant Length
Total_Relationship_Count has constant length 1 Constant Length
Months_Inactive_12_mon has constant length 1 Constant Length
Contacts_Count_12_mon has constant length 1 Constant Length
Total_Revolving_Bal has 2470 (24.39%) zeros Zeros
Avg_Utilization_Ratio has 2470 (24.39%) zeros Zeros
  • 1
  • 2

Variables


Attrition_Flag

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 811.0 KB
  • The largest value (Existing Customer) is over 5.22 times larger than the second largest value (Attrited Customer)

Length

Mean 17
Standard Deviation 0
Median 17
Minimum 17
Maximum 17

Sample

1st row Existing Customer
2nd row Attrited Customer
3rd row Attrited Customer
4th row Existing Customer
5th row Existing Customer

Letter

Count 162032
Lowercase Letter 141778
Space Separator 10127
Uppercase Letter 20254
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Existing Customer, Attrited Customer) take over 50.0%
  • Attrition_Flag has words of constant length

Customer_Age

numerical

Approximate Distinct Count 45
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 158.2 KB
Mean 46.326
Minimum 26
Maximum 73
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Customer_Age is skewed left (γ1 = -0.0336)

Quantile Statistics

Minimum 26
5-th Percentile 33
Q1 41
Median 46
Q3 52
95-th Percentile 60
Maximum 73
Range 47
IQR 11

Descriptive Statistics

Mean 46.326
Standard Deviation 8.0168
Variance 64.2693
Sum 469143
Skewness -0.0336
Kurtosis -0.2891
Coefficient of Variation 0.1731
  • Customer_Age is not normally distributed (p-value 4.053498603070004e-05)
  • Customer_Age has 2 outliers

Gender

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 652.7 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row F
2nd row M
3rd row M
4th row F
5th row F

Letter

Count 10127
Lowercase Letter 0
Space Separator 0
Uppercase Letter 10127
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (F, M) take over 50.0%
  • Gender has words of constant length

Dependent_count

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 652.7 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 3
2nd row 0
3rd row 3
4th row 2
5th row 2

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 10127
  • The top 2 categories (3, 2) take over 50.0%
  • Dependent_count has words of constant length

Education_Level

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 731.2 KB
  • The largest value (Graduate) is over 1.55 times larger than the second largest value (High School)

Length

Mean 8.9393
Standard Deviation 1.7501
Median 8
Minimum 7
Maximum 13

Sample

1st row High School
2nd row Unknown
3rd row Doctorate
4th row Uneducated
5th row Uneducated

Letter

Count 87999
Lowercase Letter 75343
Space Separator 2013
Uppercase Letter 12656
Dash Punctuation 516
Decimal Number 0
  • The top 2 categories (Graduate, High School) take over 50.0%
  • The largest value (graduate) is over 1.55 times larger than the second largest value (high)

Marital_Status

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 708.9 KB

Length

Mean 6.6845
Standard Deviation 0.6031
Median 7
Minimum 6
Maximum 8

Sample

1st row Married
2nd row Single
3rd row Divorced
4th row Single
5th row Married

Letter

Count 67694
Lowercase Letter 57567
Space Separator 0
Uppercase Letter 10127
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Married, Single) take over 50.0%

Income_Category

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 756.4 KB
  • The largest value (Less than $40K) is over 1.99 times larger than the second largest value ($40K - $60K)

Length

Mean 11.4801
Standard Deviation 2.4478
Median 12
Minimum 7
Maximum 14

Sample

1st row Less than $40K
2nd row $40K - $60K
3rd row $80K - $120K
4th row Less than $40K
5th row Unknown

Letter

Count 50014
Lowercase Letter 31599
Space Separator 17303
Uppercase Letter 18415
Dash Punctuation 4727
Decimal Number 29746
  • The top 2 categories (Less than $40K, $40K - $60K) take over 50.0%

Card_Category

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 683.5 KB
  • The largest value (Blue) is over 17.0 times larger than the second largest value (Silver)

Length

Mean 4.1175
Standard Deviation 0.4869
Median 4
Minimum 4
Maximum 8

Sample

1st row Blue
2nd row Blue
3rd row Blue
4th row Blue
5th row Blue

Letter

Count 41698
Lowercase Letter 31571
Space Separator 0
Uppercase Letter 10127
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Blue, Silver) take over 50.0%
  • The largest value (blue) is over 17.0 times larger than the second largest value (silver)

Months_on_book

numerical

Approximate Distinct Count 44
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 158.2 KB
Mean 35.9284
Minimum 13
Maximum 56
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Months_on_book is skewed left (γ1 = -0.1065)

Quantile Statistics

Minimum 13
5-th Percentile 22
Q1 31
Median 36
Q3 40
95-th Percentile 50
Maximum 56
Range 43
IQR 9

Descriptive Statistics

Mean 35.9284
Standard Deviation 7.9864
Variance 63.7828
Sum 363847
Skewness -0.1065
Kurtosis 0.3993
Coefficient of Variation 0.2223
  • Months_on_book is not normally distributed (p-value 3.1233315782174637e-22)
  • Months_on_book has 386 outliers

Total_Relationship_Count

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 652.7 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 4
2nd row 3
3rd row 6
4th row 6
5th row 3

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 10127
  • Total_Relationship_Count has words of constant length

Months_Inactive_12_mon

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 652.7 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 3
2nd row 1
3rd row 3
4th row 2
5th row 5

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 10127
  • The top 2 categories (3, 2) take over 50.0%
  • Months_Inactive_12_mon has words of constant length

Contacts_Count_12_mon

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 652.7 KB

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 3
2nd row 3
3rd row 3
4th row 2
5th row 2

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 10127
  • The top 2 categories (3, 2) take over 50.0%
  • Contacts_Count_12_mon has words of constant length

Credit_Limit

numerical

Approximate Distinct Count 6205
Approximate Unique (%) 61.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 158.2 KB
Mean 8631.9537
Minimum 1438.3
Maximum 34516
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Credit_Limit is skewed right (γ1 = 1.6665)

Quantile Statistics

Minimum 1438.3
5-th Percentile 1438.51
Q1 2555
Median 4549
Q3 11067.5
95-th Percentile 34516
Maximum 34516
Range 33077.7
IQR 8512.5

Descriptive Statistics

Mean 8631.9537
Standard Deviation 9088.7767
Variance 8.2606e+07
Sum 8.7416e+07
Skewness 1.6665
Kurtosis 1.8075
Coefficient of Variation 1.0529
  • Credit_Limit is not normally distributed (p-value 4.7164067056645815e-12)
  • Credit_Limit has 984 outliers

Total_Revolving_Bal

numerical

Approximate Distinct Count 1974
Approximate Unique (%) 19.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 158.2 KB
Mean 1162.8141
Minimum 0
Maximum 2517
Zeros 2470
Zeros (%) 24.4%
Negatives 0
Negatives (%) 0.0%
  • Total_Revolving_Bal is skewed left (γ1 = -0.1488)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 359
Median 1276
Q3 1784
95-th Percentile 2517
Maximum 2517
Range 2517
IQR 1425

Descriptive Statistics

Mean 1162.8141
Standard Deviation 814.9873
Variance 664204.3566
Sum 1.1776e+07
Skewness -0.1488
Kurtosis -1.146
Coefficient of Variation 0.7009
  • Total_Revolving_Bal is not normally distributed (p-value 7.182928903938446e-23)

Avg_Open_To_Buy

numerical

Approximate Distinct Count 6813
Approximate Unique (%) 67.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 158.2 KB
Mean 7469.1396
Minimum 3
Maximum 34516
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Avg_Open_To_Buy is skewed right (γ1 = 1.6615)

Quantile Statistics

Minimum 3
5-th Percentile 480.3
Q1 1324.5
Median 3474
Q3 9859
95-th Percentile 32183.4
Maximum 34516
Range 34513
IQR 8534.5

Descriptive Statistics

Mean 7469.1396
Standard Deviation 9090.6853
Variance 8.2641e+07
Sum 7.564e+07
Skewness 1.6615
Kurtosis 1.7971
Coefficient of Variation 1.2171
  • Avg_Open_To_Buy is not normally distributed (p-value 3.0265107438261447e-12)
  • Avg_Open_To_Buy has 963 outliers

Total_Amt_Chng_Q4_Q1

numerical

Approximate Distinct Count 1158
Approximate Unique (%) 11.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 158.2 KB
Mean 0.7599
Minimum 0
Maximum 3.397
Zeros 5
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Total_Amt_Chng_Q4_Q1 is skewed right (γ1 = 1.7318)

Quantile Statistics

Minimum 0
5-th Percentile 0.463
Q1 0.631
Median 0.736
Q3 0.859
95-th Percentile 1.103
Maximum 3.397
Range 3.397
IQR 0.228

Descriptive Statistics

Mean 0.7599
Standard Deviation 0.2192
Variance 0.04805
Sum 7695.919
Skewness 1.7318
Kurtosis 9.988
Coefficient of Variation 0.2885
  • Total_Amt_Chng_Q4_Q1 is not normally distributed (p-value 6.657603209379062e-09)
  • Total_Amt_Chng_Q4_Q1 has 396 outliers

Total_Trans_Amt

numerical

Approximate Distinct Count 5033
Approximate Unique (%) 49.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 158.2 KB
Mean 4404.0863
Minimum 510
Maximum 18484
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Total_Trans_Amt is skewed right (γ1 = 2.0407)

Quantile Statistics

Minimum 510
5-th Percentile 1283.3
Q1 2155.5
Median 3899
Q3 4741
95-th Percentile 14212
Maximum 18484
Range 17974
IQR 2585.5

Descriptive Statistics

Mean 4404.0863
Standard Deviation 3397.1293
Variance 1.154e+07
Sum 4.46e+07
Skewness 2.0407
Kurtosis 3.8915
Coefficient of Variation 0.7714
  • Total_Trans_Amt is not normally distributed (p-value 1.1258798706703985e-05)
  • Total_Trans_Amt has 896 outliers

Total_Trans_Ct

numerical

Approximate Distinct Count 126
Approximate Unique (%) 1.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 158.2 KB
Mean 64.8587
Minimum 10
Maximum 139
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Total_Trans_Ct is skewed right (γ1 = 0.1537)

Quantile Statistics

Minimum 10
5-th Percentile 28
Q1 45
Median 67
Q3 81
95-th Percentile 105
Maximum 139
Range 129
IQR 36

Descriptive Statistics

Mean 64.8587
Standard Deviation 23.4726
Variance 550.9616
Sum 656824
Skewness 0.1537
Kurtosis -0.3676
Coefficient of Variation 0.3619
  • Total_Trans_Ct has 2 outliers

Total_Ct_Chng_Q4_Q1

numerical

Approximate Distinct Count 830
Approximate Unique (%) 8.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 158.2 KB
Mean 0.7122
Minimum 0
Maximum 3.714
Zeros 7
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • Total_Ct_Chng_Q4_Q1 is skewed right (γ1 = 2.0637)

Quantile Statistics

Minimum 0
5-th Percentile 0.368
Q1 0.582
Median 0.702
Q3 0.818
95-th Percentile 1.069
Maximum 3.714
Range 3.714
IQR 0.236

Descriptive Statistics

Mean 0.7122
Standard Deviation 0.2381
Variance 0.05668
Sum 7212.676
Skewness 2.0637
Kurtosis 15.681
Coefficient of Variation 0.3343
  • Total_Ct_Chng_Q4_Q1 is not normally distributed (p-value 3.819978010152317e-09)
  • Total_Ct_Chng_Q4_Q1 has 394 outliers

Avg_Utilization_Ratio

numerical

Approximate Distinct Count 964
Approximate Unique (%) 9.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 158.2 KB
Mean 0.2749
Minimum 0
Maximum 0.999
Zeros 2470
Zeros (%) 24.4%
Negatives 0
Negatives (%) 0.0%
  • Avg_Utilization_Ratio is skewed right (γ1 = 0.7179)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0.023
Median 0.176
Q3 0.503
95-th Percentile 0.793
Maximum 0.999
Range 0.999
IQR 0.48

Descriptive Statistics

Mean 0.2749
Standard Deviation 0.2757
Variance 0.07601
Sum 2783.847
Skewness 0.7179
Kurtosis -0.7952
Coefficient of Variation 1.0029
  • Avg_Utilization_Ratio is not normally distributed (p-value 2.5608370204922458e-23)

Interactions

Correlations

Missing Values